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Abstract 

We study the evolution of Slovenia's scientific collaboration network from 1960 till present with a yearly resolution. 
For each year the network was constructed from publication records of Slovene scientists, whereby two were con- 
nected if, up to the given year inclusive, they have coauthored at least one paper together Starting with no more than 
30 scientists with an average of 1.5 collaborators in the year 1960, the network to date consists of 7380 individuals 
that, on average, have 10.7 collaborators. We show that, in spite of the broad myriad of research fields covered, the 
networks form "small worlds" and that indeed the average path between any pair of scientists scales logarithmically 
with size after the largest component becomes large enough. Moreover, we show that the network growth is gov- 
erned by near-liner preferential attachment, giving rise to a log-normal distribution of collaborators per author, and 
that the average starting year is roughly inversely proportional to the number of collaborators eventually acquired. 
Understandably, not all that became active early have till now gathered many collaborators. We also give results for 
the clustering coefficient and the diameter of the network over time, and compare our conclusions with those reported 
previously. 
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1. Introduction 



The structure of social networks is paramount for understanding the spread of knowledge, cultural traits, disease, 
as well as many other entities and attributes that can be associated with individuals living in groups or societies. As 
such it has been the subject of inten s e investigation, both theoretical as well as ernpirical , for at least half a century 
(IWasserman and Faustl Il994i IWattsl Il999t iBarabasil l2002t IChristakis and Fowlerl l2009b . Primarily, and in many 
ways not really surprisingly, however, these investigati ons were in the domain of social rather than natural sciences. 
Probably best know in this context is the study by MilgramI ( 119670 . who studied how many steps it took, on average, 
to get a letter from a randomly selected person to a stockbroker in Boston, who was a friend of Milgram's. The result 
was six - a number that has since been reused outside of science for a number of purposes, one of the latest examples 
being the launch of SixDegrees.org seeking to exploit the "small-world phenomenon" for charitable purposes. A 
shortcoming of the study of Milgram, as well as that of many others conducted in a similar fashion, is that the size 
and structure of social networks mapped in such a direct and labor intensive way is rather small and receptive to 
bias. The advent of large-scale online portals made it p ossible to test the "six degrees of separation" hypothesis more 
thoroughly. Remarkably though, a study performed bv lLeskovec and Horvitz ( 2008 ). encompassing some 30 billion 
conversations from 240 million people, reported that the average path length among Microsoft Messenger users is 6.6. 
Although being closer to seven than six, the number is nevertheless in a strikingly good agrement with the result by 
Milgram obtained over 40 years ago. 



In natural sciences the attention to networks was sparked by works such as those of lWatts and Strogatzl ( 119981) and 
Barabasi and Albe rt (1999)' making ground-breaking advances with regard to our understanding of the "small- world 
phenomenon" and the emergence of scaling via growth and preferential attachment, respectively. The two works. 
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along with subsequen t refinements of the conc epts they introduced ( Barthelemv and Amaral , 1999 : jNewman et all 



2000: Krapivskv et al., 2000; Dorogo vtsev et al. , 2000t Dorogovtsev and Mendesl 200C : Amaral et al. , [20 00: Krapi vskv and Redner 



2001: Krapivskv et al... ,2001,). spawned an impressive number of s tudies on networks, as evidenced by the many re 
views JNewman . 2000l: Albert and Barabasi . 2002 : Dorogovtsev an d Mendes. 20021: NewmanL 20031: Boccaletti et al 



2006: Dorogovtsev et al., 2008) and books (Doro govtsev and Mendesl 120031: IPastor-Satorras and Vespignanil 12004 
Newman et al.., .2006; Barrat et al,. 2008) dedicated either specifically to this field of research or its many interdisci 



phnary variations. This is all the more imp ressive since, at least within the hard sciences, prior to the late 1990s a 
paper on network theory is hard to come by ( NewmanL 120091). For a field this young the volume of insightful findings 
that have accumulated until now is something to be re ckoned with . For example, the s t ructure of networks has been 
found crucial for their resi l ience to error and attack (lAlbert et al. . l2000L ICohen et al.[ . l2000L ICallawav et all l2000L 
Cohen et al.', '2001; 'Pietsch', '2006"), for the fast availability of informati on within the wor ld-wide-web ( lAlbert et al 



1999;Pastor-Satorras et al., 200 IJ, uninterrupted supply with electricitv ([Albert et al. 
and viral infections IPastor-Sa torras and VespignanA 2002 : Zanette and KupermanL 



2004) , fast spread of epidemics 



2002 : Barthelemv et all 12004 



Colizza et al l[2007h. robust and near flawless reproduction of organisms (lHartweUetallll999l). the evolution of co- 
operation ( Santos and PachecoL 2005 : Szabo and FathL 2007 : Perc . 2009!) and coe volution ([Gross and Blasius . 2008 : 



Perc and Szolnoki , 



201 oh , thedvnamics of social systems ( Caste llano et al. , 2009[) , and surely many other aspects of 



everyday life. 

An interesting and potentially very revealing sub set of complex ne two rks are the social networks, of which scien- 
tific collaboration networks are a beautiful example ( NewmanL 2001d[[bl[c . 2004). Notably, for a social network to be 
representative for what it stands - an account of human interaction - a consistent definition of acquaintance is impor- 
tant And_whn£jU^ to define a friendship or an enemy in a consistent and precise manner (MoodvL 



200lL[Moodv and Whitel [2003[) . scientific collaboration is accurately documented in the final product and thus fairf 



stra ightforward to as sess. Also amenable to a precise defin ition of connectedn ess are movie actors dAmaral et aU 
[200 0). electric grids ([Watts and StrogatzL [l998L [Albert et alli2004) and airports ([Ouimera et all [20051) . for example. 



yet these are either approximations of social networks in that they don't really document real human cont act or the 
level of acquaintance between people forming them is difficult to determine. As argued convincingly by Newmarj 
([200 Idl) , considering scientific collaboration networks alleviates these problems to a large extend. 

Here we study the evolution of a scientific collaboration network, namely that formed by Slovenia's scientists, 
from its very beginnings in the 1960s until the present time. Covering a time span of 50 years, the data are unique in 
that they provide an excellent testing ground for the "small- world" and preferential attachment hypotheses in growing 
social net works. We tackle these issues similarly as outhned in previous studies on growing scie ntific collaboration 
networks ("n ewmanLl2001atlJin et all 1200 ItlBa rabasi et all l2002tlJeong et alll2003[|MoodvLl2004l) , where it has been 
reported, for example, that the growth is governed by linear or sublinear preferential attachment, and that as the 
networks grow their average degree increases while the average distance between individuals decreases. Evidences 
for strong clustering and models describing the growth of social networks have been presented in this context as 
well. Notably, for a set of different yet static scientific collaboration networks, New man (200 Id) has shown that the 
average distance between different authors scales logarithmically with size. We come to results that are in agrement 
with these earlier observations, but for a single growing scientific collaboration network. Moreover, we show that 
the observed near-linear preferential attachment rate translates into the expected log-normal degree distribution fairly 
accuratel y ; a detail that was previously a so urce of some discrepancy not just in the context of scientific collaboration 
networks (iJeong et alll2003 : lRedneiL[2005l) . In the continuation we first give information on the raw data and network 
construction, while the results are presented and summarized in Sections 3 and 4, respectively. 



2. Preliminaries 



Slovenia is a small country located at the heart of Europe with a population of two million(3 It has a well- 
documented research history, which is made possible by SICRIS - Slovenia's Current Research Information Syste 
- hosting up-to-date publication records of all Slovene scientists. At present, there are 30630 registered, including 
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young and non-active scientists as well as laboratory personnel, which boils down the initial number to 8402 of 
those that are truly active research-wise or have been so in the past. By this we mean those that, to date, have at 
least one bibliographic unit indexed by the Web of Science. This criterium may be somewhat stringent, but it is 
the only one we could apply consistently. Moreover, since the publication data contain records not just of research 
but also of professional work and many other activities not necessarily concerning research, it is important to define 
a threshold for when two scientists are considered connected. Having given an interview together can hardly be 
compared to writing a joint research paper In accordance with previous studies, which by default considered databases 
containing almost exclusively research papers and where therefore the setting of such a threshold was not necessary, 
we here consider two scientists as being connected if, up to the given year inclusive, they have coauthored at least 
one research or review paper together A final factor slightly affecting the network size is the necessity of identifying 
authors on a full first and last name basis. Identification based on the last name and first initial only (< 1% of 
Slovene popu lation has more th an one initial) has already been shown to downsize scientific collaboration networks 
considerably (iNewmanL 1200 1 dh . Here this problem is additionally amplified by the fact that certain last names are very 
frequent, for example because a family is traditionally involved in research over many generations, but also because 
in any given country certain last names are far more common than others. In our case this introduces an unacceptably 
large bias, grouping too many different individuals together and making them appear as a single author. On the other 
hand, requiring full first and last name agreement for identification practically never confuses two different authors for 
one, but is likely to miss out a few links if authors do not use their first names consistently. Although all publication 
records in SICRIS are always given with full first and last names (irrespective of whether the actual paper features 
just the initials of the first name), sometimes authors are evidenced in SICRIS under a given first name but then use 
an abbreviated (e.g. "Alojzij" may become "Alojz") or internationalized (e.g. "Aleksander" may become "Alex") 
version thereof, and SICRIS is not always consistent in eUminating such discrepancies. Challenging for parsing are 
also the non- ASCII characters that are quite common in Slovene names, such as c, c, s and z, but this can be solved 
with a high success rate by converting them all to standard ASCII counterparts via a unique-enough rule. Taking 
these considerations into account, we start with 30 scientists with an average of 1.5 collaborators in the year 1960 
and end with 7380 scientists with an average of 10.7 collaborators in 2010 with a yearly time resolution in-between, 
assured that the error margin is highly unlikely to affect the presented results in any perceivable way. To date the 
7380 scientists forming the network in 2010 have written 76194 papers, altogether having 735619 citations assigned 
to them. A detailed statistical analysis of i ndividual scientific indicators, which is an interesting and vibrant subj ect on 
its own (lEgghe andRouss"eaiIll990l:lHirsch, 2005; Egghe. 2006. 2008;^chi-eiber.,2008b.a:. Rousseau and vj. 12008 : 



Schreibei{l2oTol) ^ is, however, presented in a separate study (lPerd.1201 



3. Results 

We start with presenting the evolution of fundamental statistical properties of the scientific collaboration network 
in Fig.[T] The inset of panel A shows how the network size increases with time. On a semi-log scale we could attempt 
making a linear fit for at least some time windows, yet the obtained exponents would not be very meaningful given 
that a slightly negative curvature is obviously present across the majority of the plot. We therefore satisfy ourselves 
with the conclusion that the network growth is almost exponential in time, which is certainly a result worth some 
admiratio n. Note that exponen tial growth is usually associated with online networks such as Delicious or Yahoo! 



Answers jLeskovec et al.L 120081) . but could hardly be expected for a scientific collaboration network. This result can 
be taken as evidence that science is well taken care of in Slovenia, but also that it is not yet at capacity in terms of 
the number of scientists that can be sustained. The main graph in panel A, on the other hand, shows the fraction of 
the largest component with respect to the total network size in the pertaining year. Initially the network was quite 
fragmented with the largest component occupying less than 40% of its overall size. In good 20 years, however, the 
largest component started exceeding 2/3 of the network. Currently it occupies 98.7%, meaning that the vast majority 
of Slovenia's scientists is connected with one another via some number of intermediate acquaintances. Those not 
belonging to this giant component typically form sm all isolated groups that don't crucially affect the structure of the 



network. As discussed already by NewmanI (2001d), this is a desirable feature since it signalizes that science as a 
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Figure 1 : Fundamental statistical properties of the scientific collaboration network over time. All panels have a horizontal axis break, marked by the 
vertical dashed line, such that the period from 1960 — 1984 occupies 20% (30% in insets) of the total axis span, while the period from 1985 — 2010 
occupies the remainder. The axis break is motivated by the emergence of the logarithmic variation of the mean distance with size (panel D) and the 
largest component exceeding 70% of the network (panel A). This also coincides with the settling of other network properties into predictable paths 
past the year 1985. (A) Size of the largest component in percentage of the total network size N. Dashed horizontal line marks the 70% borderiine, 
which is exceeded in 1985. The inset shows network growth, starting with N = 30 scientists in 1960 and ending with N = 7380 in 2010. Network 
growth is almost exponential in time, with the curve having a slightly negative curvature on a semi-log scale. (B) Clustering coefficient and the 
average number of collaborators z (inset) over time. Past the year 1985 z increases exponentially. (C) Mean distance between authors and the 
diameter (inset) over time. The tipping point is the year 1985, following which both quantities gain a downward momentum. (D) Mean distance 
between authors past 1985 in dependence on log N/ log z- Dashed red line is the best linear fit to the data while the solid green line is the best linear 
fit going also through the origin. 



whole is a product of joint and strongly interdisciplinary rather than isolated efforts. This, in turn, can be seen as na 
important driving force behind the ever faster scientific progress and innovation. 

The emergence of the giant component brings with it some noticeable changes in the way certain quantifiers of 
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the network structure evolve over time. In order to highlight this transition, each panel of Fig. [T]has a horizontal 
axis break giving visual priority to the 1985 - 2010 period, during which the giant component occupies more than 
70% of the network. The main graph in panel B shows that the clustering coefficient, calculated as thre e times 
the number of triangles divided by the number of connected triples of vertices [see e.g. iNewmanI (l2003h l. starts 
decreasing monotonously, while simultaneously the average number of collaborators per author (average degree of 
the network) assumes a steady exponential increase (see insert). Prior to 1985 the trends are less clear and fluctuating, 
but qualitatively similar Although the clustering coefficient is decreasing over the years, with around 20% chance of 
two scientist collaborating if both have done so with a third scientist in 2010, it can still be concluded that clustering 
is an inherent property of the studied scientific collaboration network. Altogether these results are in good agrement 



with e arlier findings obtained for larger scientific collaboration networks but over a shorter time span (iBarabasi et al 



20021) . Notably, large increments in the level of collaboration over an even longer time span then c onsidered here have 



been reported also by iGrossmanI ( 120021) for a scientific collaboration network of mathematicians dGrossman and Ion , 



19951). 



Most impressive, however, is the shift in the dependence of the mean distance between authors and the diameter 
of the network, as depicted in panel C of Fig. [T] (main graph and inset). The year 1985 constitutes a tipping point, 
following which the upward trend prior is replaced by a steady downward trend. However, since the diameter is 
obviously more prone to fluctuations than the mean distance, the tipping point in the inset is expressed less accurately, 
but still clearly. It is remarkable that in spite of the broad myriad of research fields covered by Slovenia's scientists, 
the networks still form "small worlds" in that the path between any given pair (provided it exists) leads across only a 
few - on average less than five but definitely not more than 14 in 2010 - intermediate acquaintances. It is common 
to compare thes e values with what would be obtained on a random graph having the same size and average degree 
(lBollobasl[l985l) . Indeed, random graphs can be considered as the simplest of models of social networks. Panel D of 
Fig.[T|shows how the average distance varies in dependence on \ogN/ logz (exactly the variation of the mean distance 
observed on random graphs) within the 1985 - 2010 period, along with the best linear fit (dashed red line) and the 
best straight line fit going also through the origin (solid green line). Evidently, there is a considerable overlap with 
the logarithmic variation that can be observed for random graphs, especially past the year 1995, where the points 
fall almost perfectly onto the solid green line, thus confirming the existence of the "small-world phenomenon" in the 
studied scientific collaboration network. Notably, this result is interesting on its own as an empirical demonstration of 
logarithmic variation with size in a real growing social network. 

In addition to the above presented results, the data enable testing the prefer ential attachment hypothesis in growing 
social networks. Growth and preferential attachment have been proposed bv IBarabasi and Albert (Il999h as the two 
crucial factors for the emergence of scaling in random networks. Notably, synonymous to preferential attachment 
are the terms cumulative advantage ( jde Solla Price , 1965. .1976.) and the Matthew effect (Merton. 1968) , which were 
proposed in the 60s but attracted considerably less attention. It is also interesting to note that the gener al evolutionar y 
theory of information production processes can be applied successfully o n evolving complex ne tworks ('E2gh4 2007 ), 
and that in this context the so-called "success-breeds-success" principle dSgghe and Rousseau . .1995. 2006) was used 
synonymously. In essence, however, these terms all describe that the more one has (of something), the more likely it 
is one will eventually gain even more; and conversely, that the ones that have very little are unlikely to recover Trans- 
lating this to growing scientific collaboration networks, it means that the more collaborators one has at a given time, 
the more likely it is one will attract new scholars in the future. On the other hand, if one has only a few collaborators, 
it is less likely s/he will establish new acquaintances. Indeed, this seems like a very reasonable proposition. 

Panel B of Fig. |2] shows how the number of collaborators increases over the years for every scientist considered 
in this study. Individuals are thereby enumerated according to their rank, as determined by the total number of 
collaborators they have in the year 2010. Number one is the scientist that in 2010 has the highest degree (equalling 
122), author number two has the second-largest degr ee (equallin g 119), and so on. The upper-most color stripe in 
Fig. 1^ (for the year 2010) thus gives the Zipf plot ( Zipl . Il949h of the number of collaborators in the year 2010. 
Since the Zipf plot can give vital clues on the expected distribution of the examined quantity, we show it in panel A 
of Fig. |2] separately on a semi-log and log-log (inset) scale. From the informetrics perspective it is noteworthy that 
the Lotka law, describing for example the frequenc y of pubUcatio n by authors, is often equivalent to the Zipf law in 
that it also assumes an inverse square form [see e.g. fegghe (2005) for an interesting treatment]. Had the Zipf plot on 
the log-log scale a linear outlay with the slope /3, this would be equivalent to a power-law distribution of the number 
of collaborators y of the form P(y) ~ y'>', with y = 1 + 1//?. Furthermore, this would then imply linear preferential 
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Figure 2: (A) Zipf plot of the number of collaborators versus the k-th ranked author on a semi-log (inset shows log-log) scale, as obtained for the 
year 2010. (B) Color-coded evolution of individual collaborator numbers leading to the Zipf plot in panel A. Each vertical line refers to a single 
author, whereby authors are ranked according to the total number of collaborators they have in the year 2010 (see panel A). The start of each 
vertical line corresponds to the year the pertaining author received his/her first collaborator, i.e. when s/he became active and thus a part of the 
collaboration network. The coloring denotes how the number of collaborators of each author increased over the years (from the time of becoming 
active till 2010), according to the color bar on the right. Not all authors that became active early eventually gathered many collaborators, although 
there is a fairly clear trend towards an earlier start when approaching the P' ranked author. This is evidenced by the solid black line, depicting the 
average starting years that were determined within non-overlapping windows each containing 300 consecutively ranked authors. 



attachment governing the growth of the studied network. Yet this is not the case since the Zipf plot on the log-log scale 
has an obvious negative curvature. The semi-log scale is similarly inconclusive, thus hinting towards an interesting 
degree distribution and the underlying attachment rate. 

Before determining this, however, it is worth examining how the year of becoming active translates to the number 
of collaborators one eventually acquires over the years. Although becoming active in the sense of writing the first 
research paper is not necessarily identical to becoming active in the sense of acquiring at least one collaborator (for 
the obvious reason that one can write a research paper as a sole author), this discrepancy is small enough to be 
negligible. The color map presented in panel B of Fig. |2] shows that the sooner one started acquiring collaborators, 
the more likely it is one has a lot of them in the year 2010. In other words, the age of a node is roughly inversely 
proportio nal to its degree, which is charact eristic for networks whose growth is governed by preferential attachment 
[see e.g. Ben-Naim and Krapivskvl (l2009h for a recent treatment]. However, for growing scientific collaboration 
networks this conclusion is obviously challenged for several reasons. Note that a sliding non-overlapping window of 
300 consecutively ranked scientists yields the average starting year as depicted by the black solid line in panel B, but 
individual discrepancies to this are far from uncommon. An obvious reason contributing to the deviation is that not 
all people are equally fond of making new acquaintances. This is true for life in general, and probably even more 
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Figure 3: (A) Distribution of tlie number of collaborators y on a log-log scale. Solid green line is the log-normal fit P(y) ~ exp[fl In y - fo(lny)^] 
of the data, with a = 0.69 and b = 0.46. (B) Attachment rate A(y) = Ay/y versus the number of collaborators on a log-log scale. Depicted is the 
average over 50 years, where y was determined in the year x and Ay in the year x + I for x = 1960, 1961, ... , 2009. Solid green line is the nearly 
linear fit A(y) ~ y/(l + c In y), with c = 1 .32. For comparison, dotted blue line is the sublinear fit A{y) ~ y^ , with g = 0.79, and the dashed red line 
is the linear fit A(y) ~ hy, with h = 0.053. While sublinear attachment rates give rise to s tretched exponential distributions I Krapivskv et al., 2000l), 
log-normal distributions arise from nearly linear preferential attachment <Redneill2005l) , thus making results presented in panels A and B in good 
agreement with one another 



SO for scientific collaboration. Moreover, a scientist may have gone dormant in the course of time, thus stopping 
the acquirement of new collaborators. Nevertheless, the statement that older scientists are more likely to have more 
collaborator than younger scientists is certainly reasonable and valid, and results presented in panel B of Fig.|2]support 
such a conclusion to a large extent. 

Turning now to testing the preferential attachment hypothesis, we first show in panel A of Fig. [3] the degree 
distribution of the network in the year 2010, i.e. the number of authors with a given number of collaborators. On 
a log-log scale the points exhibit, similarly as the Zipf plot in Fig. |2] a negative curvature across the whole span of 
the number of collaborat ors. Accordingly, a power-law fit, even with an exponential cutoff, cannot describe these 
data accurately [see e.g. IClauset etalJ (l2009h l. Indeed, it turns out that a log-normal form, as given in the caption 
of Fig. [3] is most fitting, as evidenced by the solid green line. Wh i le log -normal forms are typically associated 
with random multiplicative processes [see e.g. iKrapivskv and Rednen (l2007h l. recently Redner used it for fitting the 
cumulative citation distribution of the Physical Review over the past 110 years. There it was also pointed out that 
such distributions may arise from near-linear preferential attachment. To test this for our social network we have 
determined the average attachment rate A(y), giving the likelihood that an author with y collaborators in a given year 
will have Ay more collaborators in the next year, as specifie d in the cap t ion of Fig. |3] This way of measuring the 
preferential attachment in evolving networks was proposed bv lJeong et al.l (120031) . and used also for the assessment of 
preferent ial attachment i n a gro wing scientific collaboration network, albeit over a shorter time span than considered 
here, by iBarabasi et alJ (l2002h . Results presented in the panel B of Fig. [3] show that the data can be fitted best 
by the near-linear form A(y) ~ y/(l + clny) (solid green line). For comparison, a linear fit (dashed red line) and a 
"traditional" sublinear fit (dotted blue line) are shown as well. From this we can conclude that the growth of Slovenia's 
scientific collaboration network is governed by near-linear preferential attachment, which translates fairly accurately 
into the expected log-normal distribution of collaborators per author. Indeed, near-linea r attachment ra tes of the form 
A(y) ~ y/(l -H clny) yield log-normal distributions as depicted in panel A of Fig. [3] (lRednei . 2005 ). thus giving a 
coherent ending to our study. 
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4. Summary 



In sum, we have studied the growth and structure of Slovenia's scientific collaboration network over the past fifty 
years, focusing specifically on testing the "small-world" and preferential attachment hypotheses, but analyzing also 
other aspect of the network structure in detail. We have shown that there exists a tipping point in time after which 
the mean distance between authors and the diameter start decreasing. Accompanied by an exponential increase of the 
average number of collaborators per author and a near-exponential growth of the network over time, this gives rise to 
a logarithmic variation of the mean distance with size, much in agreement with what would be observed for growing 
random networks. Notably, the emergence of the tipping point coincides with the largest component exceeding 70% of 
the network size, but roughly also with the breakup of Yugoslavia and the subsequent downfall of the socialist regime. 
It is unclear whether the latter fact contributed to the emergence of the transition, but the time-wise correlation gives 
some room for speculations. Turning back to the network growth and structure, we have shown that the clustering 
coeflicient decreases in time, but by hoovering quite comfortably over 0.2 in the year 2010, it can be concluded that 
the clustering is still fairly strong. Finally, we have shown that the growth of the network is governed by near-linear 
preferential attachment that translates fairly accurately into the expected degree distribution, and that the year of 
becoming active scales inversely with the number o f collaborators thus far acquired. Although deviation s from linear 
preferential attachment have been reported before ('Newman', '2001a'; 'Barabasi et al.', '2002 Hjeong et all l2003h . also 
outside the realm of scientific collaboration networks (Redner, 2005; Capocci et al., 2006), an agreement between the 
attachment rate and the resulting distribution derived from empirical data seems to be quite rare. Altogether, however, 
the presented results are in agreement with previous findings, supporting the conclusion that the growth of social 
networks is governed by preferential attachment, and that the resulting network structure has properties characteristic 
for "small worlds". 
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